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Abstract. We address the synthesis of distributed control 30 
policies to enable a swarm of homogeneous mobile sensors 
to maintain a desired spatial distribution in a geophysical 
flow environment, or workspace. In this article, we assume 
the mobile sensors (or robots) have a "map" of the environ- 
ment denoting the locations of the Lagrangian coherent struc- 35 
tures or LCS boundaries. Based on this information, we de- 
sign agent-level hybrid control policies that leverage the sur- 
rounding fluid dynamics and inherent environmental noise 
to enable the team to maintain a desired distribution in the 
workspace. We establish the stability properties of the en- 40 
semble dynamics of the distributed control policies. Since 
realistic quasi-geostrophic ocean models predict double-gyre 
flow solutions, we use a wind-driven multi-gyre flow model 
to verify the feasibility of the proposed distributed control 
strategy and compare the proposed control strategy with a 45 
baseline deterministic allocation strategy. Lastly, we validate 
the control strategy using actual flow data obtained by our 
coherent structure experimental testbed. 



1 Introduction 

Recent years have seen the use of autonomous underwater 
and surface vehicles (AUVs and AS Vs) for persistent surveil- 
lance of the ocean to study the dynamics of various bio- 
logical and physical phenomena, such as plankton assem- 



blages (|Caron et al. 



2008 ), temperature and salinity profiles 



([Lynch et al.||2008HWu and Zhang|[20TT] [Sydn ey and Paley 
20111, and the onset of harmful algae blooms ( Zhang et al. 



mostly focused on the deployment of single, or small num- 
bers of, AUVs working in conjunction with a few stationary 
sensors and AS Vs. While data collection strategies in these 
works are driven by the dynamics of the processes they study, 
few existing approaches explicitly account for the surround- 
ing fluid dynamics ( [Lolla et aL 2012| DeVries and P aley 



|2011| ). In fact, most existing works treat the effect of the 



surrounding fluid as external disturbances (Das et al. 2011 



Williams and Sukhatme 2012), largely because of our lim- 



2007; |Chen etaL] [20081 [Das et al.||20TT) . These studies have 



ited understanding of the complexities of ocean dynamics. 

Geophysical flows are naturally stochastic and aperiodic, 
yet exhibit coherent structure. Coherent structures are of sig- 
nificant importance since knowledge of them enables the pre- 
diction and estimation of the underlying geophysical fluid 
dynamics. In realistic ocean flows, these time-dependent co- 
herent structures, or Lagrangian coherent structures (LCS), 
are similar to separatrices that divide the flow into dynami- 
cally distinct regions. LCS are extensions of stable and un- 
stable manifolds to general time-dependent flows (Haller and 
Yuan, 2000), and they carry a great deal of global informa- 
tion about the dynamics of the flows. For two-dimensional 
(2D) flows, ridges of locally maximal finite-time Lyapunov 
exponent (FTLE) ( [Shadden et al.| |2005|) valu es correspond, 
to a good approximation (though see (Haller |2011| )), to La- 
grangian coherent structures. LCS have been shown to coin- 
cide with optimal trajectories in the ocean which minimize 
the energy and the time needed to traverse from one point to 
another ( [Inane et al.| [20051 |Senatore and Ross[ [200§] ). Fur- 
thermore, for monitoring of oceanic processes, e.g., to im- 
prove weather and climate forecasting or to better understand 
contaminant transport, one would like to use autonomous 
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sensors to measure a variety of quantities of interest. A draw- 
back to operating sensors in time-dependent and stochas- 
tic environments like the ocean is that the sensors will es- 
cape from their monitoring region of interest with some fi- 
nite probability. Since the LCS are inherently unstable and 
denote regions of the flow where escape events occur with 



higher probability ( For goston et al. 201 1 ), knowledge of the 
LCS are of paramount importance in maintaining a sensor in 
a particular monitoring region. 

Recently, Hsie h et al. ( 2012| ) described a collaborative 
control strategy for a class of autonomous underwater ve- 
hicles (AUVs) to track the coherent structures and manifolds 
on flows. In this paper, we now present a distributed control 
strategy for AUVs and/or mobile sensing resources to main- 
tain a desired spatial distribution assuming a priori knowl- 
edge of manifold/coherent structure locations within a re- 
gion of interest. Specifically, we devise agent-level control 
policies which allow individual AUVs to leverage the sur- 
rounding fluid dynamics and inherent environmental noise 
to navigate from one dynamically distinct region to another 
in a fluidic environment. The result is a distributed alloca- 
tion strategy that minimizes the overall control-effort em- 
ployed by the team to maintain the desired spatial forma- 
tion for environmental monitoring applications. While this 120 
problem can be formulated as a multi-task (MT), single-robot 
^SR), tim e-extended assignment (TA) problem (Gerke yand| 
Mataric, 2004 ), existing approaches do not take into account 
the effects of fluid dynamics coupled with the inherent en- 



vironmental noise (Gerkey and Mataric, 200 2{ |Dias et al. 
2006 



, , Hsi eh et al.[ |2008[ |Berman et al. 



2006| |Dahl et al.[ [20 

As such, the novelty of this work lies in the use of 
nonlinear dynamical systems tools and recent results in dis- 1 



tributed robotic tracking of manifolds and structures (Hsieh i = 0, 



|et al. |2012| ) to synthesize distributed control policies that en- 
ables agents to maintain a desired distribution in a fluidic 
environment. 

The paper is structured as follows: We formulate the prob- 
lem and outline key assumptions in Section [2] The develop- 130 
ment of the distributed control strategy is presented in Sec- 
tion[3]and its theoretical properties are analyzed in Section]?] 
Section [5] presents our simulation methodology and results. 
We conclude with a discussion of our results, and directions 
for future work in Sections [6] and [^respectively. 1 35 

2 Problem Formulation 

Consider the deployment of N mobile sensing resources 
(AUVs/ASVs) to monitor M regions in the ocean. The ob- 140 
jective is to synthesize agent-level control policies that will 
enable the team to autonomously maintain a desired distri- 
bution across the M regions in a dynamic and noisy fluidic 
environment. We assume the following kinematic model for 
each AUV: " 145 




(a) 



(b) 



Fig. 1. (a) Vector field and (b) FTLE field of the model given by 
{2} for two gyres with A = 10, fi = 0.005, e = 0, ip = 0, J = 0.01, 
and s — 50. LCS are characterize by regions with maximum FTLE 
measures (denoted by red). In 2D flows, regions with maximum 
FTLE measures correspond to 1-D curves. 



where q/e = [x k: Vk, z k ] T denotes the vehicle's position, u k 
denotes the 3 x 1 control input vector, and Vf(qj c ) denotes 
the velocity of the fluid experienced/measured by the k th ve- 
hicle. 

In this article, we limit our discussion to 2D planar flows 
and motions and thus we assume Zk is constant for all k. 
As such, vj(q) is given by a 2D vector field denoted by 
F(q) such that F z (q) = for all q. Since realistic quasi- 
geostrophic ocean models exhibit multi-gyre flow solutions, 
we assume v/(q) = F(q) where F(q) is provided by the 2D 
wind-driven multi-gyre flow model given by 



. . , f(x,t) . . y. . x 

x — —TrAsmiir )cos(7r-J — fix + r/im, 

s s 

. . f(x,t) N , y \df , s. 

y = 7rAcos(7r )sin(7r— )- (iy + 772 ft) , 

s s ax 



x -\- (J) 

f(x,i) = x + ss'mtn ) sintut + ip). 
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(2a) 

(2b) 
(2c) 
(2d) 



When e = 0, the multi-gyre flow is time-independent, while 
for e ^ 0, the gyres undergo a periodic expansion and con- 
traction in the x direction. In Eq. ([2]), A approximately de- 
termines the amplitude of the velocity vectors, u/2tt gives 
the oscillation frequency, e determines the amplitude of the 
left-right motion of the separatrix between the gyres, ip is 
the phase, /i determines the dissipation, s scales the dimen- 
sions of the workspace, and rji (t) describes a stochastic white 
noise with mean zero and standard deviation a = V2I, for 



q k = u k + v f (q k ) he {!,..., n}, 



(1) 



noise intensity /. Figs . 1 1 (a)| and 1 1 (b) | show the vector field of 
a two-gyre model and the corresponding FTLE curves for the 
time independent case. 

Let W denote an obstacle-free workspace with flow dy- 
namics given by ([2]). We assume a tessellation of W such 
that the boundaries of each cell roughly corresponds to sta- 
ble/unstable manifolds or LCS curves quantified by max- 
imum FTLE ridges as shown in Fig. [2] A tessellation of 
the workspace along boundaries characterized by maximum 
FTLE ridges makes sense since they separate regions within 
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Fig. 2. Two examples of cell decomposition of the region of interest 
based on (2]). (a) A 4 x 4 time independent grid of gyres with A — 
0.5, ii 0.005, e = 0, i\) = 0, I = 35, and s = 20. The stable and 
unstable manifolds of each saddle point in the system is shown by 
the black arrows, (b) An FTLE based cell decomposition for a time- 195 
independent double-gyre system with the same parameters as Fig. 



the flow field that exhibit distinct dynamic behavior and de- 200 
note regions in the flow field where more escape events may 
occur probabilistically (Forgost on et al. 2011| ). In the time 
independent case, these boundaries correspond to stable and 
unstable manifolds of saddle points in the system. The man- 
ifolds can also be characterized by maximum FTLE ridges 205 
where the FTLE is computed based on a backward (attract- 
ing structures) or forward (repelling structures) integration 
in time. Since the manifolds demarcate the basin bound- 
aries separating the distinct dynamical regions, they are also 
regions that are uncertain with respect to velocity vectors 

within a neighborhood of the manifold. Therefore, switching 

& ' &210 

between regions in neighborhoods of the manifold is influ- 
enced both by deterministic uncertainty as well as stochas- 
ticity due to external noise. 

Given an FTLE-based cell decomposition of W, let Q = 
(V,£) denote an undirected graph whose vertex set V = 
{ Vi , . . . , Vm} represents the collection of FTLE-derived cells 
in W. An edge exists in the set £ if cells Vi and Vj share a 215 
physical boundary or are physically adjacent. In other words, 
Q serves as a roadmap for W. For the case shown in Fig. 



2(a) adjacency of an interior cell is defined based on four 



neighborhoods. Let Ni denote the number of AUVs or mo- 
bile sensing resources/robots within Vi. The objective is to 220 
synthesize agent-level control policies, or u^, to achieve and 
maintain a desired distribution of the N agents across the M 
regions, denoted by N = [N\ , . . . , Nm] T , in an environment 
whose dynamics are given by ([2]). 

We assume that robots are given a map of the environ- 225 
ment, Q, know the locations of the LCS boundaries for each 
Vi, have the ability to communicate with other robots col- 
lated within the same Vi, and N is known a priori. These 
assumptions are necessary to enable a prioritization scheme 
within each Vi based on an individual robot's escape like- 230 
lihoods in order to achieve the desired allocation. The pri- 



oritization scheme will allow robots to minimize the control 
effort expenditure as they move within the set V. We describe 
the methodology in the following section. 

3 Methodology 



We propose to leverage the environmental dynamics and the 
inherent environmental noise to synthesize energy-efficient 
control policies for a team of mobile sensing resources/robots 
to maintain the desired allocation in W at all times. We 
assume each robot has a map of the environment. In our 
scenario, this assumption translates to providing the robots 
knowledge of LCS boundaries that define each ViinQ. While 
this assumption may seem unrealistic, since LCS curves sep- 
arate W into regions with distinct flow dynamics, this is 
analogous to providing autonomous ground or aerial vehi- 
cles with a map of the environment which is often obtained a 
priori. In a fluidic environment, the map can be obtained by 
locating the maximum FTLE ridges based on historical data 
and tracking these boundaries, potentially in real-time, us- 
ing a strategy similar to the one found in Hsi eh et al. ( 2012 ). 
Thus, we assume each robot has a map of the environment 
and has the ability to determine the direction it is moving in 
within the global coordinate frame, i.e., the ability to local- 
ize. 



3.1 Controller Synthesis 

Consider a team of N robots initially distributed across M 
gyres/cells. Since the objective is to achieve a desired allo- 
cation of N at all times, the proposed strategy will consist 
of two phases: an auction to determine which robots within 
each Vi should be tasked to leave/stay and an actuation phase 
where robots execute the appropriate leave/stay controller. 

3.1.1 Auction Phase 

The purpose of the auction phase is to determine whether 
Ni(t) > Ni and to assign the appropriate actuation strategy 
for each robot within Vi. Let Qi denote an ordered set whose 
elements provide robot identities that are arranged from high- 
est escape likelihoods to lowest escape likelihoods from Vi . 

In general, to first order we assume a geometric measure 
whereby the escape likelihood of any particle within Vi in- 
creases as it approaches the boundary of Vi, denoted as dVi 
(For goston et al.[ [201 lj ). Given W, with dynamics given by 
(2j, consider the case when e = and 1^0, i.e., the case 
when the fluid dynamics is time independent in the pres- 
ence of noise. The boundaries between each Vi are given by 
the stable and unstable manifolds of the saddle points within 



W as shown in Fig. |2(a)| While there exists a stable attrac- 
tor in each Vi when / = 0, the presence of noise means that 
robots originating in Vi have a non-zero probability of land- 
ing in a neighboring gyre Vj where G £. Here we assume 
that robots experience the same escape likelihoods in each 
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gyre/cell and assume that Pk(-ii\i), the probability that a 
robot escapes from region i to an adjacent region, can be es- 
timated based on a robot's proximity to a cell boundary with 



k$Q ijj aaa\\q k -dVi\\ 00 <d m 
or Ik £ Vi 



^ome ass umption of the environmental noise profile (Forgos- 
[ton etal.j|2gTT> . 

In other words, Qi = . . . , k^.} such that — 
OViWoo < Uka-dViWoo < ... < \\qNt-dViWoo where |MU 
denotes the infinity norm. The set Qi provides the prior- 
itization scheme for tasking robots within Vi to leave if 
Ni(t) > Ni. The assumption is that robots with higher es- 
cape likelihoods are more likely to be "pushed" out of Vi by 
the environment dynamics and will not have to exert as much 
control effort when moving to another cell, minimizing the 
overall control effort required by the team. 270 

In general, a simple auction scheme can be used to de- 
termine Qj in a distributed fashion by the robots in Vi ( |Dias| 
|etal.||2006| ). If Ni (t) > Ni , then the first Ni - Ni elements of 
Qi, denoted by Qi L c Qi, are tasked to leave Vi. The number 
of robots in Vi can be established in a distributed manner in a 275 
similar fashion. The auction can be executed periodically at 
some frequency 1/T a where T a denote the length of time be- 
tween each auction and should be greater than the relaxation 
time of the AUV/ASV dynamics. This step is summarized in 
Algorithm [T] 

Algorithm 1 Auction Phase 



9: 
10: 
11: 
12: 



if ElapsedTime T a then 

Execute auction and determine Ni (£) and Qi 
if Ni(t) >N then 
if k G Ql then 

Uk — > UiNk G Ql 
else 

u k Us 
end if 
else 

Uk — > UsVk G Qi 
end if 
end if 



3.1.2 Actuation Phase 

295 

For the actuation phase, individual robots execute their as- 
signed controllers depending on whether they were tasked to 
stay or leave during the auction phase. As such, the individ- 
ual robot control strategy is a hybrid control policy consisting 
of three discrete states: a leave state, Ul, and two stay 
states, Us A and Us P . Robots who are tasked to leave will 
execute Ul until they have left Vi or until they have been 300 
once again tasked to stay. Robots who are tasked to stay 
will execute U Sp if \\q kl ~ dViW^ > d min and U Sa other- 
wise. In other words, if a robot's distance to the cell boundary 
is below some minimum threshold distance d m i n , then the 
robot will actuate and move itself away from the boundary of 305 
Vi. If a robot's distance to dVi is above d m i n , then the robot 




HQ* 
andlgfc-dVilloc > d mi n 



k i Qi L 
and||g fc -(?F i || 00 >rf m 



Fig. 3. Schematic of the single-robot hybrid robot control policy. 

will execute no control actions. Robots will execute Us A un- 
til they have reached a state where \\qk 1 — dVi\\oo > d m i n or 
until they are tasked to leave at a later auction round. Sim- 
ilarly, robots will execute Us P until either — dV^loo < 
d m i n or they are tasked to leave. The hybrid robot control 
policy is given by 



Us A (<lk) = ~Ui x c 
U Sp (q k ) = 0. 



F(q k ) 



\\n<*kW 



(3a) 

(3b) 
(3c) 



Here, uoi = [0, 0, 1] T denotes counterclockwise rotation with 
respect to the center of Vi, with clockwise rotation being de- 
noted by the negative and c is a constant that sets the linear 
speed of the robots. The hybrid control policy generates a 
control input perpendicular to the fluid velocity measured by 
robot k and pushes the robot away from the center of Vi if 
Ul is selected, towards the center of Vi if U s A is selected, 
or results in no control input if U s P is selected The hybrid 
control policy is summarized in Fig. [3] 

In general, the Auction Phase is executed at a frequency 
of 1/T a which means robots also switch between controller 
states at a frequency of l/T a . To further reduce actuation ef- 
forts exerted by each robot, it is possible to limit a robot's ac- 
tuation time to a period of time T c <T a . Such a scheme may 
prolong the amount of time required for the team to achieve 
the desired allocation, but may result in significant energy- 
efficiency gains. We further analyze the proposed strategy in 
the following sections. 

4 Analysis 

In this section, we discuss the theoretical feasibility of the 
proposed distributed allocation strategy. Instead of the tradi- 
tional agent-based analysis, we employ a macroscopic anal- 
ysis of the proposed distributed control strategy given by Al- 
gorithm [T] and ([3]) similar to that found in Mather and Hsieh 
( |2011| ). For a team of N robots, each executing the same hy- 
brid control strategy shown in Fig. [3] we represent the en- 
semble dynamics as a polynomial stochastic hybrid system 
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(pSHS). The advantage of this approach is that it allows the 
use of moment closure techniques to model the time evolu- 
tion of the distribution of the team across the various cells. 

310 The system state is given by N(t) = [Ni (£), . . . , NM(t)] T '. 
As the team distributes across the M regions, the rate in 
which robots leave a given Vi can be modeled using con- 
stant transition rates. For every edge G £, we assign a 
constant > such that gives the transition probability 

315 per unit time for a robot from Vi to land in Vj . Different from 
Mather and Hsieh (2011), the a^ 's are a function of the pa- 
rameters (c, T c , and T a ) of the individual robot control policy 
([3]), the dynamics of the surrounding fluid, and the inherent 
noise in the environment. Furthermore, is a macroscopic 

320 description of the system and thus a parameter of the ensem- 
ble dynamics rather than the agent-based system. As such, 
the macroscopic analysis is a description of the steady-state 3 eo 
behavior of the system and becomes exact as N approaches 
infinity. 

325 Given Q and the set of a^ 's, we model the ensemble dy- 



namics as a set of transition rules of the form: 



N 



e£. 



(4r 



The above expression represents a stochastic transition rule 
with ciij as the per unit transition rate and Ni (t) and Nj (t) as 
discrete random variables. In the robotics setting, equation 
^ implies that robots at Vi will move to Vj with a rate of 370 
dijNi. We assume the ensemble dynamics is Markovian and 
note that in general ^ dji and encodes the inverse of 
the average time a robot spends in Vi. 

Given ([4]) and employing the extended generator we can 
obtain the following description of the moment dynamics of 375 
the system: 



|E[N] = AE[N] 



(5) 



340 .where [A]i 



H 



J JJl 



and [A]i 



E 



(i,j)es a v 



(Mather and 



sieh| 2011 ). It is important to note that A is a Markov pro 



cess matrix and thus is negative semidefinite. This coupled 
with the conservation constraint J2 i Ni = N leads to expo- 
nential stability of the system given by §5§ ( Klavins 2010 ). 

In this work, we note that a^ 's can be determined exper- 
imentally after the selection of the various parameters in the 
distributed control strategy. While the a^ 's can be chosen to 
enable the team of robots to autonomously maintain the de- 
sired steady- state distribution ( [Hsieh et aT!|2008| ), extraction 
of the control parameters from user specified transition rates 
is a direction for future work. Thus, our main result in given 

by ~ 3 

Theorem 1 Given a team ofN robots with kinematics given 
by ([T]) and v f given by ([2]), the distributed allocation strategy 
given by Algorithm^and ([3}, at the ensemble level is stable 
and achieves the desired allocation strategy. 



We refer the interested reader to Mather and Hsieh ( 2011| ) for 
the details of the proof. 
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Fig. 4. Three desired allocation of the team of N — 500 mobile 
sensing resources/robots, (a) A Ring pattern allocation, (b) a Block 
pattern allocation, and (c) an L- shaped pattern allocation. Each box 
represents a gyre and the number within the desired number of 
robots. 



5 Simulation Results 

We illustrate the proposed control strategy described by Al- 
gorithm ([T]) and ([3]) with some simulation results using both 
the time-invariant and time- varying model given by ([2]). In 
our simulations the environmental flow field is given by the 
2D wind-driven multi-gyre flow model ([2]). In our simula- 
tions, we assume W consists of a 4 x 4 grid of gyres such 



that each Vi G V corresponds to a gyre as shown in Fig. 2(a) 



To ensure that the total number of robots remain constant, let 
W c denote the complement of the workspace. As such, our 
workspace consists of M + 1 = 17 monitoring regions. 

We assume N — 500 and T a = 10 and initialize the team 
of 500 robots randomly across the M gyres. Each simulation 
was performed for a long enough period of time until steady- 
state has been reached. We considered three different alloca- 
tions across the grid of 4 x 4 gyres forming a Ring, a Block, 
and an L- Shape pattern as shown in Fig. [4] For the ring pat- 
tern, we consider the case when the actuation was applied for 
T c = fT a amount of time where / = 0.1,0.2, . . . , 1.0. The 
Block and L- Shape patterns we considered the cases when 
T c = 0.5T a and T c = T a . 



380 5.1 In Time-Invariant Flows 

For time-invariant flows, we assume e = 0,A = 0.5,s = 20, 
H = 0.005, and / = 35 in ([2]). The final population distribu- 
tion of the team for the case with no controls and the cases 
with controls for each of the patterns are shown in Fig. [5] 

For each pattern and corresponding value of T c , we ex- 
ecute a total of five runs. To evaluate the performance of 
the proposed strategy, we compute the average root mean 
squared errors (RMSE) of the population differences over all 
M + 1 cells within the total space over all runs. Additionally, 
we consider the total control expenditure by the team. The to- 
tal control expenditure is computed by integrating the magni- 
tude of the average control velocity over time for each robot. 
We compare our results to a baseline deterministic alloca- 
tion strategy where the desired allocation is pre-computed 
and individual robots follow fixed trajectories when navigat- 
ing from one gyre to another. For this baseline case, robots 
travel in straight lines at fixed speeds using a simple PID tra- 
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(a) No Control 



(b) Ring 





(c) Block 



(d) L-Shape 



Fig. 5. Histogram of the final allocations in the time-invariant 
flow field for the swarm of (a) passive robots exerting no con- 
trols and robots exerting control forming the (b) Ring pattern with 
T c = 0.8T a at t = 450, (c) Block pattern with T c = T a at t = 450, 
and (d) L-shape pattern with T c = 0.5T a at t = 450. 



Table 1. Summary of RMSE's for each simulation pattern at t=450 
with the time-invariant flow field. 



jectory follower and treat the surrounding fluid dynamics as 
an external disturbance source. The RMSE results for all pat- 
terns are summarized in Table [T] and Figs.[6][8] 

425 

5.2 In Time-Varying Flows 

For the time- varying, periodic flow, we assume A = 0.5, 
s = 20, /i = 0.005, I = 35, p = 2s, <j> = and i/j = in 0. 
Additionally, we considered the performance of our control 430 
strategy for different values of lj and e with T c = 8. In all 
these simulations, the boundaries of each Vi is given by the 
FTLE ridges computed for the time-independent system. The 
final population distribution of the team for the case with no 
controls and the cases with controls for each of the patterns 435 
are shown in Fig. [9] The final population RMSE for the cases 
with different uj and e values for the Ring and L-shape pat- 



terns are shown in Fig. 10 These figures show the average 
of 10 runs for each uj and e pair. In each of these runs, the 
swarm of mobile sensors were initially randomly distributed 440 



within the grid of 4 x 4 cells. Finally, Fig. 1 1 and 12 shows the 
population RMSE as a function of time and the average con- 
trol effort for an individual robot for the Ring and L-shape 
patterns respectively. 




T c 


2 


5 


8 


9 


10 


Baseline 


Ring Pattern 


12.99 


5.98 


3.45 


3.49 


3.66 


4.09 


Block Pattern 




11.21 






12.72 




L Pattern 




30.09 






30.45 





Fig. 6. Comparison of RMSE and Total Control Effort for the 
Ring pattern for different T c with the baseline controller with time- 
invariant flows. 



5.3 In Experimental Flows 

Using our 0.6m x 0.6m x 0.3m experimental flow tank 
equipped with a grid of 4x3 set of driving cylinders, we gen- 
erated a time-invariant multi-gyre flow field. Particle image 
velocimetry (PIV) was used to extra the surface flows at 7.5 
Hz resulting in a 39x39 grid of velocity measurements. The 
data was collected for a total of 60 sec. Fig. shows the top 
view of our experimental testbed and the resulting flow field 
obtained via PIV. Using this data, we simulated a swarm of 
500 mobile sensors executing the control strategy given by 
Eq.@. 

To determine the appropriate tessellation of the 
workspace, we averaged the positions of the LCS ridges 
obtained for each frame of the velocity field over time. This 
resulted in the discretization of the space into a grid of 4 x 3 
cells. Each cell corresponds to a single gyre as shown Fig. 
14 The cells of primary concern are the central pair and the 
remainder boundary cells were not used to avoid boundary 
effects and to allow robots to escape the center gyres in all 
directions. The robots were initially uniformly distributed 
across the two center cells and all 500 robots were tasked 
to stay within the upper center cell. When no control effort 
is exerted by the robots, the final population distribution 



achieved by the team is shown in Fig. 15(a) With controls, 
the final population distribution is shown in Fig. |15(b)| The 
control strategy was applied assuming T c /T a = 0.8. The 
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final RMSE for different values of c in Eq. ^ and T a is 
shown in Fig. 16(a) and RMSE as a function of time for 



different values of c and T a are shown in Fig. 16(b) 



6 Discussion 

In time-invariant flows, we note that for large enough T c , 
our proposed distributed control strategy performs compa-4?o 
rable to the baseline controller both in terms of steady- state 
error and convergence time. As T c decreases, less and less 
control effort is exerted and thus it becomes more and more 
difficult for the team to achieve the desired allocation. This is 
confirmed by both the RMSE results summarized in Table 
and Fig. |6(a)|8(a)| Furthermore, while the proposed control 
strategy does not beat the baseline strategy as seen in Fig. 



6(a) it does come extremely close to matching the baseline 



strategy performance, while requiring much less control ef- 



fort as shown in Fig. [6(b)] even at high duty cycles, i.e., when4so 
T c /T a > 0.5. 

More interestingly, we note that executing the proposed 
control strategy at 100% duty cycle, i.e., when T c = T a , in 



time-invariant flows did not always result in better perfor- 
mance. This is true for the cases when T c = 0.5T a = 5 for 
the Block and L-shaped patterns shown in Fig. |7(a)| and |8(a)| 
In these cases, less control effort yielded improved perfor- 
mance. However, further studies are required to determine 
the critical value of T c when less control yields better over- 
all performance. In time-invariant flows, our proposed con- 
troller can more accurately match the desired pattern while 
using approximately 20% less effort when compared to the 
baseline controller. 

In time- varying, periodic flows we note that our proposed 
control strategy is able to achieve the desired final allocation 
even at 80% duty cycle, i.e., T c = 0.ST a . This is supported 
by the results shown in Fig.[l0] In particular, we note that the 
proposed control strategy performs quite well for a range of 
uj and e parameters for both the Ring and L- shape patterns. 
While the variation in final RMSE values for the Ring pattern 
is significantly lower than the L- shape pattern, the variations 
in final RMSE values for the L-shape are all within 10% of 
the total swarm population. 
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(a) No Control 



(b) Ring 





(c) L-Shape 

Fig. 9. Histogram of the final allocations in periodic flows, with pa- 
rameters of uj = and e = 5, for the swarm of (a) passive robots 
exerting no controls and robots exerting control forming the (b) 
Ring pattern with T c = 0.8T a at t = 450, and (c) L-Shape pattern 
withT c = 0.5T a att = 450. 




(a) Ring 



(b) L-Shape 



Fig. 10. Final population RMSE for different values of uj and e 
with T c — 8 and d m in — 4. The remaining model parameters are 
the same as in the time-invariant case. 



3fc 



Fig. 11. Comparison of RMSE and Control Effort over time for 
select uj and e pairs for the Ring pattern in periodic flows. 




Lastly, the results obtained using the experimental flow 
field shows that the proposed control strategy has the poten- 
tial to be effective in realistic flows. However, the resulting 
performance will require good matching between the amount 
of control effort a vehicle can realistic exert, the frequency in 
which the auctions occur within a cell, and the time scales of 



490 the environmental dynamics as shown in in Figs. 16(a) and 



16(b) This is an area for future investigation. 



7 Conclusions and Future Outlook 

In this work, we presented the development of a distributed 
hybrid control strategy for a team of robots to maintain a de- 
sired spatial distribution in a stochastic geophysical fluid en- 




Fig. 12. Comparison of RMSE and Control Effort over time for 
select uj and e pairs for the L- shape pattern in periodic flows. 
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(b) 



Fig. 13. (a) Experimental setup of flow tank with 12 driven cylin- 
ders, (b) Flow field for image (a) obtained via particle image ve- 
locimetry (PIV). 





% 






Hi 






V 



Fig. 14. FTLE field for the mean of the experimental data set over 
time. The field is discretized into a grid of 4 x 3 cells whose bound- 
aries are shown in black. 




(a) 



(b) 



Fig. 15. Population distribution for a swarm of 500 mobile sensors 
over a period of 60 sec (a) with no controls, i.e., passive, and (b) 
with controls. 





(b) 



Fig. 16. (a) Final RMSE for different values of c (Eq. [3} and T a 
using the experimental flow field. T c /T a = 0.8 is kept constant 
throughout, (b) RMSE over time for select c and T a parameters 
on an experimental flow field. The duty cycle T c /T a = 0.8 is kept 
constant throughout. 



vironment. We assumed robots have a map of the workspace 
which in the fluid setting is akin to having some estimate of 
the global fluid dynamics. This can be achieved by knowing 
the locations of the material lines within the flow field that 
separate regions with distinct dynamics. Using this knowl- 
edge, we leverage the surrounding fluid dynamics and inher- 
ent environmental noise to synthesize energy efficient con- 
trol strategies to achieve a distributed allocation of the team 
to specific regions in the workspace. Our initial results show 
that using such a strategy can yield similar performance as 
deterministic approaches that do not explicitly account for 
the impact of the fluid dynamics while reducing the control 
effort required by the team. 

For future work we are interested in using actual ocean 
flow data to further evaluate our distributed allocation strat- 
egy. Specifically, we are interested in using more compli- 
cated flow models including a bounded single-layer PDE 
ocean model (Forgos ton et al.| 201 a m ulti-layer PDE 
ocean model ( |Wang et al.||2009[ Lolla et al.|[20T2| ), and re- 
alistic 2D and 3D unbounded flow models provided by the 
Navy Coastal Ocean Model (NCOM) database. Particularly, 
we are interested in extending our strategy to non-periodic, 
time- varying flows. In addition, we are currently developing 
an experimental testbed capable of generating complex 2D 
flows in a controlled laboratory setting. The objective is to 
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be able to evaluate the proposed control strategy using ex- 
perimentally generated flow field data whose dynamics are 
similar to realistic ocean flows. Finally, since our proposed 575 
strategy requires robots to have some estimate of the global 
525 fluid dynamics, another immediate direction for future work 
is to determine how well one can estimate the fluid dynam- 
ics given knowledge of the locations of Lagrangian coherent 

580 

structures (LCS) in the flow field. 
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